Minimalist Grammars and Minimalist 
Categorial Grammars, definitions toward 
inclusion of generated languages 

Maxime Amblard^ 

Nancy Universite - INRIA Nancy-Grand Est 
amblardSlor ia . f r 



Abstract. Stabler proposes an implementation of the Chomskyan Min- 
imalist Program, [I] with Minimalist Grammars - MG, [2]. This frame- 
work inherits a long linguistic tradition. But the semantic calculus is 
more easily added if one uses the Curry-Howard isomorphism. Minimal- 
ist Categorial Grammars - MCG, based on an extension of the Lambek 
calculus, the mixed logic, were introduced to provide a theoretically- 
motivated syntax-semantics interface, |^. In this article, we give full 
definitions of MG with algebraic tree descriptions and of MCG, and take 
the first steps towards giving a proof of inclusion of their generated lan- 
guages. 

The Minimalist Program - MP, introduced by Chomsky, [T], unified more 
than fifty years of linguistic research in a theoretical way. MP postulates that 
a logical form and a sound could be derived from syntactic relations. Stabler, 
[5] , proposes a framework for this program in a computational perspective with 
Minimalist Grammars - MG. These grammars inherit a long tradition of genera- 
tive linguistics. The most interesting contribution of these grammars is certainly 
that the derivation system is defined with only two rules: merge and move. The 
word Minimalist is introduced in this perspective of simplicity of the definitions 
of the framework. If the merge rule seems to be classic for this kind of treatment, 
the second rule, move., accounts for the main concepts of this theory and makes 
it possible to modify relations between elements in the derived structure. 

Even if the phonological calculus is already defined, the logical one is more 
complex to express. Recently, solutions were explored that exploited Curry's 
distinction between tectogrammatical and phenogrammatical levels; for example. 
Lambda Grammars, [1], Abstract Categorial Grammars, [5], and Convergent 
Grammars [B]. First steps for a convergence between the Generative Theory 
and Categorial Grammars are due to S. Epstein, A full volume of Language 
and Computation proposes several articles in this perspective, [8], in particular 
[9], and Cornell's works on links between Lambek calculus and Transformational 
Grammars, [lOj . Formulations of Minimalist Grammars in a Type- Theoretic way 
have also been proposed in [TT] , [T^] , [13] ■ These frameworks were evolved in [T3] , 
13, [E] for the syntax-semantics interface. 

Defining a syntax-semantics interface is complex. In his works. Stabler pro- 
poses to include this treatment directly in MG. But interactions between syntax 



and semantic properties occur at different levels of representation. One solution is 
to suppose that these two levels should be synchronized. Then, the Curry-Howard 
isomorphism could be invoked to build a logical representation of utterances. The 
Minimalist Categorial Grammars have been defined from this perspective: cap- 
ture the same properties as MG and propose a synchronized semantic calculus. 
We will propose definitions of these grammars in this article. But do MG and 
MCG genrate the same language? In this article we take the first steps towrds 
showing that they do. 

The first section proposes new definitions of Minimalist Grammars based on 
an algebraic description of trees which allows to check properties of this frame- 
work, [3]. In the second section, we will focus on full definitions of Minimalist 
Categorial Grammars (especially the phonological calculus) . We will give a short 
motivation for the syntax-semantics interface, but the complete presentation is 
delayed to a specific article with a complete example. These two parts should be 
viewed as a first step of the proof of mutual inclusion of languages between MG 
and MCG. This property is important because it enables us to reduce MG's to 
MCG, and we have a well-defined syntax-semantics interface for MCG. 

1 Minimalist Grammars 

Minimalist Grammars were introduced by Stabler [2] to encode the Minimalist 
Program of Chomsky, [l] . They capture linguistic relations between constituents 
and build trees close to classical Generative Analyses. 

These grammars are fully lexicalized, that is to say they are specified by their 
lexicon. They are quite different from the traditional definition of lexicalized 
because they allow the use of specific items which do not carry any phonological 
form. The use of theses items implies that MG represent more than syntactic 
relations and must be seen as a meta-calculus lead by the syntax. 

These grammars build trees with two rules: merge and move which are trigged 
by features. This section presents all the definitions of MG in a formal way, using 
algebraic descriptions of trees. 

1.1 Minimalist Tree Structures 

To provide formal descriptions of Minimalist Grammars, we differ from tradi- 
tional definitions by using an algebraic description of trees: a sub-tree is defined 
by its context, as in [TB] and [T7]- For example, the figure on the left of the figure 
[IJshows two subtrees in a tree (^i and t2) and their context (Ci and C2). Before 
we explain the relations in minimalist trees, we give the formal material used to 
define a tree by its context. 

Graded alphabets and trees: Trees are defined from a graded set. A graded 
set is made up of a support set, noted S, the alphabet of the tree, and a rank 
function, noted cr, which defines node arity (the graded terminology results from 
the rank function). In the following, we will use E to denote a graded {S,a). 



The set of trees built on S, written Tj;, is the smahest set of strings {S U 
{(; ); , })*. A leaf of a tree is a node of arity 0, denoted by a instead of a(). For 
a tree t, ii t — a(ti, • • • , tj^), the root node of t is written cr . 

Moreover, a set of variables X = {xi,X2, • • •} is added for these trees. Xk is 
the set of k variables. These variables mark positions in trees. By using variables, 
we define a substitution rule: given a tree t € T^(^Xk) (*-^- ^ ^^^^ which contains 
instances of k variables xi, • • • , Xk) and ti, ■ ■ ■ ,tk, k trees in Tj;, the tree obtained 
by simultaneous substitution of each instance of a;i by ii , . . . , a;^ by is denoted 
by t[ti, ■ ■ ■ ,tk]- The set of all subtrees of t is noted St- 

Thus, for a given tree t and a given node n of t, the subtree for which n is 
the root is denoted by t with this subtree replaced by a variable. 

Minimalist trees are produced by Minimalist Grammars and they are built 
on the graded alphabet {<,>,Z'}, whose ranks of < and > are 2 and for 
strings of S. Minimalist Trees are binary ones whose nodes are labelled with < 
or >, and whose leaves contain strings of S. 

Relations between sub-trees We formalise relations for different positions of 
elements in St- Intuitively, these define the concept of be above, be on the right 
or on the left. A specific relation on minimalist trees is also defined: projection 
that introduces the concept of be the main element in a tree. 

In the following, we assume a given graded alphabet S. Proofs of principal 
properties and closure properties are all detailed in The first relation is the 
dominance which informally is the concept of be above. 

Definition 1 Let t E T^, and Ci,C2 G St, Ci dominates C2 (written Ci <* 
C2 ) if there exists C S St such that Ci [C] — C2 . 

Figure [T] shows an example of dominance in a tree. One interesting property 
of this algebraic description of trees is that properties in sub-trees pass to tree. 
For example, in a given tree t, if there exists Ci and C2 such that Ci <* C2, 
using a 1-context C, we could build a new tree t' = C[t] (substitution in the 
position marked by the variable xxi of t). Then, C[Ci] and C[C2] exist (they 
are part of t') such that C[Ci] < C[C2]. 

Definition 2 Let t e Ts, Ci,C2 G St, Ci immediately precedes C2 (written 
Ci ^ C2) if there exists C E St such that: 

1. Ci^C[a{ti,...,tj,xi,tj+2,---,tk)] and 

2. C2 — C[(T(ti, . . . , tj, tj+i, xi, . . . , tj.)]. 

Precedence, written is the smallest relation defined by the following 

rules ( transitivity rule, closure rule and relation between dominance and prece- 
dence relation): 

Ci C2 C2 C3 Ci C2 Ci <\* C2 

[irons] [*] [dam] 

Ci C3 Ci C2 C2 Ci 




— Ci is the context of the sub-tree ti 

— C2 is the context of the sub-tree t2 

— Ci <* C2 means that the root node of 
ti is higher than the root node of f2 in 
the full tree 



— Ci is the context of the sub-tree t\ 

— C2 is the context of the sub-tree t2 

— Ci <* C2 means that the root node of 
ti is on the left side of the root node 
of t2 in the tree 



Fig. 1. Dominance and precedence relations in trees. 



Precedence encodes the relation he on the left (and then he on the right) or 
he ahove another element (using the dominance). These two relations stay true 
for substitution (as mentioned above). 

The next relation does not ck^fine a tree relation. It realises a linguistic prop- 
erty by leading the concept of he the main element in a structure (or a substruc- 
ture). 

Definition 3 Let t £ T^^ci^)' C!i,C2 € St, Ci immediately projects 

on C2 (written Ci < C2) if there exists C G St such that one of the two following 
properties holds: 

1. Ci=C[<{xiM)] andC2 = C[<{ti,xi)], 

2. Ci=C[>{t2,x{)] and C2 = C[>{xuti)], 

in this case C <Ci and C < C2. // Ci<C2 or C2<Ci, then there exists C 
such that C < Ci and C < C2 . 

is the smallest relation defined by the following system of rules: 

C G St C\<'^C2 C2<"^Cs C\<C2 
[0] [trans] 



C\ <\* C2 C3 <* C4 C2<C3 Ci < C2 C2<.Cs 

z [^] z 



Note that the projection relation is transitive. AU the properties of these 
three relations are proven in [3] . The figure [2] presents three minimalist trees 
where in t the main element is the verb walks (which is accessible by following 
the projection relation). 

These three relations could seem quite complicated for a reader who is not 
familiar with these notations or the zipper theory. But their expressiveness allows 
to prove the structural properties assumed for MG and moreover to give the proof 
of languages inclusion with MCG. Finally, in this section, we have defined the 
concept of parent and child relations in trees plus the projection relation which 
defines constituents in linguistic descriptions. 

1.2 Linguistic Structures in Trees 

From the linguistic perspective, trees represent relationships between grammat- 
ical elements of an utterance. Linguistic concepts are associated with minimalist 
tree structures. These relationships have been proposed for the analysis of struc- 
tural analogies between verbal and nominal groups. Thus, groups of words in a 
coherent statement (phrases), whatever their nature, have a similar structure. 
This is supposed to be the same for all languages, regardless of the order of sub- 
terms. This assumption is one of the basic ideas of the X-bar theory introduced 
in the seventies, [18] and in the MP, [1 . 



The head is the element around which a group is composed. An easy way to 
find the head of a minimalist tree is to follow the projection relation of the nodes. 

Definition 4 Let t e Tmg, if for all C E St, C<""C' then C is called the head 
oft. For a given tree t € Tmgj ''^^ write Ht[x\ E St a sub-tree oft of which x is 
the head, and head(t) is a leaf which is the head oft. Then t = Ht[head{t)]. 

For a minimalist tree, there always exists a unique minimal element for the 
projection relation and it is a leaf (which is the head of the tree) [3]. 

For example, the head of the minimalist tree in figure [2] is the leaf walks 
(follow the direction of the projection relation in nodes and stop in a leaf). 
Subtrees have their own head, for example the leaf a is the head of the subtree 
ti (in figure [2]) and the preposition in is the head ot t^. 

Maximal Projection is, for a leaf I, the largest subtree for which I is the 
head. This is the inverse notion of head. In the minimalist tree of figure [2j the 
maximal projection of the leaf walks is the full tree t. To describe other maximal 
projections in this example, the maximal projection of a is the subtree which 
contains a man and the maximal projection of the man is the leaf man. In a 
more formal way, the maximal projection is defined as follows: 



> 




the street 

t ii t2 



Fig. 2. a minimalist tree t and two of its sub-tree 

Definition 5 Let t E Tmg, C E St- The maximal projection of C (denoted 
by proimax{C)) is the subtree defined by: 

- if C ^Xi, projmaxiC) = Xi 

- if C ^ C"[< {xi,t)] orC = C'[> {t,xi)], projmaxiC) ^projmaxiC') 

- if C = C'[< it,x,)] or C = C'[> ixi,t)], projraax {C) = C 

Then projmax{walks) = t. This logical characterization of minimalist trees 
and structural relations allows to prove different properties of MG (for example 
that the projection is anti-symmetric), [3]. 



Complement and Specifier are relations on subtrees with respect to the head. 

Elements coming after the head provide information and they are in the 
complement relation. Let t g Smg-^ Ci is a complement of head{t) = C, if 
projmaxiC) <* Ci and C ^+ Ci, denoted by Ci comp C. 

In the tree t of figure [2j the subtree ^2 is in a complement relation with the 
head walks. It adds information to the verb. 

By contrast, elements placed before the head determine who (or what) is in 
the relationship. Let t G Smg, Ci is a specifier of head{t) = C, if projmaxiC) <\* 
Ci and Ci ~<~^ C, denoted by Ci spec C. 

In the tree t of figure [2] the subtree ti is in a specifier relation with the head 
walks. It specifies interpretation of the verb. 

1.3 Minimalist Grammars 

The computational system of MG is entirely based on features which represent 
linguistic properties of constituents. Rules are trigged by these features and 
they build minimalist trees. A Minimalist Grammar is defined by a quintuplet 

{V, Features, Lex, ^, c) where: 

— is a finite set of non-syntactic features, which contains two sets: P (phono- 
logical forms, marked with / /), and / (logical forms, marked with ()). 

— Features— {B U S* U U L^} is a finite set of syntactic features, 

— Lex is a set of complex expressions from P and Features (lexical items), 

— <P — {merge, move} is the set of generative rules, 



— c £ Features is the feature which allows to accept derivations. 

The final tree of a derivation which ends with acceptance is called a deriva- 
tional tree, which corresponds to a classical generative analysis. Phonological 
forms are used as lexical items (and they could be seen as the grammar's ter- 
minal symbols). A left-to-right reading of phonological forms in derived and 
accepted structures provides the recognized string. But intermediate trees in a 
derivation do not stand for this. Only the derivational tree allows to recognize 
a string. This results from the move rule which modifies the tree structure. For 
a MG G , the language Lq recognized by G is the closure of the lexicon by the 
generation rules. 

1.4 Features 

A MG is defined by its lexicon which stores its resources. Lexical items consist of 
a phonological form and a list of syntactic features. The syntactic set of features 
is divided in two subsets: one for basic categories, denoted B, and one for move 
features, denoted D. Different types of features are: 

— B = {v, dp, c, • • •} the set of basic features. Elements ofB denote standard 
linguistic categories. Note that this set contains c, the accepting feature (I 
assume it is unique at least). 

— S = {=d I d € B} the set of selectors which expresses the necessity of 
another feature of B of the same type (for d ^ B, =d is the dual selector). 

— La = {+k I k G D} the set of licensors. These features assign an expres- 
sion's property to complement another in a specifier-head relation. 

— Le = {—k I k G D} the set of licensees. These features state that the 
expression needs to be complemented by a similar licensor. 

Lexical sequences of features follow the syntax: /FP/ : {S{S U La)*)* B{Le)* 




Fig. 3. Automata of acceptable sequences of features where b ^ B and d E D. 

Vermaat, |19j . proposes an automata which recognises the acceptable se- 
quences, proposed in figure [3] This structure could be divided in two parts: the 
first containing a sequence of selectors and licensors (features which trigger rules, 
as we shall see) , and the second which contains only one basic feature (the gram- 
matical category associated to the expression) and a sequence of licensees. The 



first part corresponds to stat I and II and the second to stat III and transitions 
to this state. In the following, e will denote any feature and E a sequence of 
features (possibly empty). 

For example, the sequence associated with an intransitive verb will be: —d +case v 
which means that this verb must be jointed with a determinal phrase {determi- 
nal comes from the Generative Theory), a complex expression with feature d. 
Then it must be combined with a —case, we will see how in the next section, an 
then there is a structure associated with verb (feature v). 

Transitive verbs will extend the intransitive ones wth the list: 

=d +case ~d +case v 

The two =d correspond to the subject and the object of the verb. The first case 
will be accusative and the second nominative. 

Another example is determiners: they are combined with a noun to build a 
determiner phrase and need to be unified in the structure (see the next section). 
Here is an example of lexicon which contains a verb, a noun and a determiner: 

walks : =d +case v 
a : =71 d —case 
man : n 



1.5 MG Rules 

^, the set of generating rules, contains only: merge and move. A derivation is a 
succession of rule applications which build trees. These trees are partial results: 
the structural order of phonological forms does not need to correspond to the 
final one. In the MP, a specific point, called Spell-Out is the border between 
the calculus of derivations and the final result. Rules are trigged by the feature 
occurring as the first element of list of features of the head. 

Merge is the process which connects different parts. It is an operation which 
joins two trees to build a new one: 
merge : Tmg x Tmg -> Tmg 

It is triggered by a selector {=x) at the top of the list of features of the head 
and it is realised with a corresponding basic feature (x) at the top of the list of 
features of the head of a second tree. Merge adds a new root which dominates 
both trees and cancels the two features. The specifier/complement relation is 
implied by the lexical status of the tree which carried the selector. The new root 
node points to this tree. 

Let t,t' e Tmg be such that t = Ht[l : E] and t' = Ht'[l' : h E'] with 
/i€ B: 

merqe{tt') = [<^^-- ^'^"[^'^ ^'^^ ^ ^ Lex, 

merge^i, i) | ^ ^^^^ . ^^j; . otherwise. 

Figure |4] presents the graphical representation of merge. 



if t G Lex 

t : =h E t' 




merge(t,t') 




h E' 



t' : A merge(t,t') : 




=h E h E' 




move(t) 



Fig. 4. Tree representation of merge and move. 

For example, to derive a man walks, we first need to combine a with man, 
and tlien to combine the resuh with the verb: 



< 



a man 

=^ d —case 



and 



walks 

+case V 




a man 
—case X 



Obtained trees do not verify the word order (only the final tree will check 
the right word order). In this example, the selectors are carried by lexical items, 
then projection relations point to the left in both cases. 



Move encodes the main idea of the Minimalist Program. It corresponds to the 
movement of a constituent at the top position in a derivation. Move is trigged 
by a licensor at the top of the list of features of the head of a tree. Then, 
it looks for a corresponding licensee (— x) at the top of the list of features of 
the head inside the tree. If these conditions are met, the maximal projection of 
the node which carries the licensee is moved to the left of a new root. This node 
points to the right (the subtree which carries the former head) . Both licensor and 
licensee are cancelled. The root of the moved maximal projection is substituted 
by an empty leaf (e). This new leaf is called the trace of the move. 

Figure |4] shows a graphical representation of the move rule where the head 
of C carries a +g in its top features list. Then we look for a leaf with —g in 
its top features list and then find its maximal projection (C2) which contains 
all the elements which depend on it. Finally this sub-tree is moved to the left 
position of a new root node. Intuitively, a linguistic property is checked and the 
consequence is a move in first position in the tree. And strictly: 

move : Tmg ^ Tmg 



For all tree t = C[l : +g E,l' : -g E'], such that t = Ht[l : +g E], there 
exists Ci, C2 G St such that: C2 is the maximal projection of the leaf I' and Ci 
is t deprived of C2. Then, t ^ Ci[l : +g E, C2[l' : -g E']] where: 

- C2[l' : -5 E'] = proj,„„,(C[r : -g E]) 

- Ci[l: +g E, xi] = projmax{C[l : +g E, xi]) 

move{t) = >{C2[l' ■■ E%Ci[l -.E^e]) 

Figure |4] presents the graphical representation of move. 

Stabler introduces some refinements to these grammars. Let us mention them. 
He introduces a second move: weak move, which does not move the phonological 
forms. The precedent move is then called strong move, which is trigged with 
capital features. The weak move is, like strong move: 

move{t) = >(C2[e : E'],Ci[l : E,l']) 

Variations on strong/weak values achieve variations on phonological order. 
This is an instance of the use of parameters of the Minimalist Program. 

Moreover, restrictions can be introduced on MG derivations. An important 
one is the Shortest Move Condition (SMC) which blocks move in case of ambi- 
guity on licensees. Then, the move operation of MG with SMC is deterministic. 

A locality condition could also be introduced: Specifier Island Condition - 
SPIC. "Islands" define areas which prohibit extractions. With SPIC, a subtree 
cannot be moved if it is in a specifier relation within a subtree. This condition was 
introduced by Stabler, in 20] drawing on works of |2l] and [22], who proposes 
that moved elements had to be in a complement relation. 

In the previous example, the head of the last tree is the leaf walks which 
contains a +case feature as first element of its list. Then, a move is trigged 
in the tree with the leaf a which carries a (—case). The resulting tree is the 
following: 



> 




< < 



a man walks e 

^^^eage X ^^^^^^ 

The move operation modifies the position of the maximal projection of the 
leaf which carries the —case. The old position is substituted by an empty leaf 
(e). Finally, the tree contains only one feature which is v. In this small example, 
I did not discuss the validity of the final feature, but in a real derivation, we 
assume that it is not the verb which carries the -\-case licensor which corresponds 
to the nominal case, but it is a specific item. This item corresponds to the 
morphological mark of the verb. Then each acceptable derivation assumes that 
a verb has received its time (and other properties). But exhibiting the use of 



this item needs other refinements of the two rules (Head-movement and Affix- 
Hopping) . 

This section did not propose a new framework for computational linguistics. 
This is a new definition of Stabler proposal. This way, assumed properties of min- 
imalist trees have been fully proved, Moreover this algebraic definition of MG 
is a perfect description to compare generated languages with other frameworks. 
Finally, this modifies the point of view on derivations and shows all steps of the 
calculus as substitution. One missing point is still the introduction of a semantic 
calculus. Let us now develop MCG which are defined with a syntax-semantics 
interface. 

2 Minimalist Categorial Grammars - MCG 

In this section, we define a new Type-Theoretic Framework which is provided 
by the mixed calculus, a formulation of Partially Commutative Linear Logic. It 
proposes to simulate MG and then keep linguistic properties of the Minimalist 
Program. MCG are motivated by the syntax-semantics interface, [3]. This in- 
terface, as for Lambek calculus, is based on an extension of the Curry-Howard 
isomorphism, [23]. Even though this interface is not the aim of this paper, let us 
discuss some important points. 

The idea of encoding MP with Lambek calculus arises from |ljy and ex- 
tended versions of this work. In these propositions, the calculus is always non- 
commutative, a property needed to model the left-right relation in sentences. But 
the move operation could not be defined in a proper way with non-commutative 
relation. In particular, in complex utterances, the non-commutativity implies 
that a constituent (for example the object DP) must be fully treated before 
another one is introduced (for example the subject DP). Otherwise, features 
are mixed and non-commutativity blocks resolutions. It is not acceptable to 
normalize the framework with such a strong property and it makes the system 
inconsistent in regard to linguistics. 

The solution we propose is to define a new framework which allows to deal 
with commutative and non-commutative connectors: the mixed calculus. The 
main consequence on the model of this calculus is that variables in logical for- 
mulae are introduced at different places and must be unified later. In [3] we 
show how the unification is used to capture semantic phenomena which are not 
easily included. In few words, the idea is to consider proofs of mixed calculus as 
phases of a verb. Phases have been introduced by Chomsky to detail different 
modifications which occur on a verb. Several linguists have showed that phases 
have implications on semantics, for example the theta-roles must be allocated 
after a specific phase. This is exactly the result of the syntax-semantics inter- 
face of MCG. Full explanations need more space to be presented, but the main 
contribution of MCG is to propose an efficient syntax-semantics interface in the 
same perspective as MG. 

In this section, we will detail MCG and expose their structural link with MG. 
First we present the mixed calculus, then we give definitions of MCG and show 



proofs of the mixed calculus produced by MCG (together with their linguistic 
properties). 



2.1 Mixed calculus 

MCG are provided with mixed calculus, [21], a formulation of Partially Commu- 
tative Linear Logic. Hypotheses are either in a non-commutative order (<; >) or 
in a commutative one ((, )) The plain calculus contains introduction and elimi- 
nation rules for: 

— the non-commutative product 0: 

a^aqb r,< A;B >.r' h c Ah a rhB 

■ [Qe] [Q; 

r,A,r'hc < A;r >h aq B 

— its residuals (/ and \): 

ThA AhA\C AhA/C BhA 

<r-A>hC '^'^ <A;r>hC 
<A;r>hC <r;A>hC 

rhA\C FhC/A 

— the commutative product ®'. 

AhA(g)B r,{A,B),r'hC AhA FhB 

r,A,r'hC ^ ^ {A,F)hA®B' 

— its residual 

FhA AhA^C iA,r)hC 
{r,A)hC ^^"^ FhA^C 

The product connectors of the mixed calculus use in a first step hypotheses 
to mark positions in the proof and in a second one substitute the result of an an- 
other proof in these positions using a product elimination (the commutative/non- 
commutative status depends on relations between hypotheses). This is exactly 
the process we will use to define the move rule of MCGs. 

Moreover, the calculus contains an axiom rule and an entropy rule. This last 
one allows to relax the order between hypotheses. We will use this rule to define 
merge in MCG as we will see in the following section. 

r • 1 rhc 

A\- A ^"^"^^^"^^ — ; [entropy — whenever F' C F] 

F \~ C 

This calculus has been shown to be normalizable, [25) and derivations of 
MCG will be proofs of the mixed calculus in normal form. 



2.2 Minimalist Categorial Grammars 

As MG, MCG are lexicalized grammars. Derivations are led by formulae associ- 
ated with lexical items built with connectors of the mixed logic. They are specific 
proofs of the mixed logic, labelled to realise the phonological and semantic tiers. 
Phonological labels on proofs will be presented with definitions of MCG rules. 
A MCG is defined by a quintuplet {N,P, Lex, C) where : 



— is the union of two finite disjoint sets Ph and / wliich are respectively the 
set of phonological forms and the one of logical forms. 

— P is the union of two finite disjoint sets Pi and P2 which are respectively the 
set of constituent features (the set B of MG) and the one of move features 
(the set D of MG). 

— Lex is a finite subset of i? x F x /, the set of lexical items 

— <P = {merge, move} is the set of generative rules, 

— C e P is the accepting formulae. 

As mentioned in the previous section, move is defined using a product elim- 
ination. In MG, a constituent is first introduced in a tree using its basic feature 
and then can be moved using its licensees. In MCG, a constituent will be intro- 
duced only when all its positions (which correspond to the basic feature and its 
licensees) have been marked in the proof by specific hypotheses. But we need 
to distinguish the type of the basic feature from the licensees features. That is 
why P is divided in two subsets Pi and P2. This sub-typing of formulae is used 
to well define lexicons of MCG. 

The set E is Ph*, and the set F, the set of formulae used to build Lex, is 
defined with the set P, the commutative product and the two non-commutative 
implications / and \ . Formulae of F are recognized by the non-terminal L of the 
following grammar: 

l::= (b)/pi |c 

B ::=Pi\(b) |p2\(b) | C 

C P2 (E) (c) I Ci 

Ci Pi 

In more details, MCG formulae start with a / which is followed by a se- 
quence of \. This sequence contains operators allowing to compose the proof 
with another one (operators are the translation of selectors and licensors). Lex- 
ical formulae are ended by a sequence of (8). To sum up, these formulae have the 
structure (c„i\ . . . \ci\(6i ® . . . (g)&„ (g) a))/d, with a e Pi, bi e P2, Cj £ P and d G 
Pi. This structure corresponds to the two parts of the list of features we have 
mentioned in the previous section. 

For the example a man walks, the MCG's lexicon is the following: 

walks : case\v/d 

a : [case ® d)/n 
man : n 

Licensees, which express the need for an information, are there seen as a 
specific part of the basic feature (a part of the main sub-type). Licensors will 
be cancelled with an hypothesis to mark a position in the proof. Distinction 
between them is not written by an ad hoc marker but by structural relations 
inside the formula. Before we explain the move and merge rules, let us present 
the phonological tiers. 

^ In the following. Lex is a subset oi E x F. The semantic part is used for the syntax- 
semantics interface which is not detailed here. 



2.3 Derivations 



Labels. Derivations of MCG arc labelled proofs of the mixed calculus. Before 
defining labelling, we define labels and operations on them. 

Let V be an uncountable and finite set of variables such that: PhOV = 0. T is 
the union of Ph and V. Wc define the set S, called labels set as the set of triplets 
of elements of T* . Every position in a triplet has a linguistic interpretation: they 
correspond to specifier/head/complement relations of minimalist trees. A label 
r wiU be considered as r = (rspec, r header comp)- 

For a label in which there is an empty position, we adopt the following nota- 
tion: T—fiead ~ (^speci ^7 '^comp)i "^—spec — "^headi ^comp)? ^—comp ~ i'^spec^ ^head: 

We introduce variables in the string triplets and a substitution operation. They 
are used to modify a position inside a triplet by a specific material. Intuitively, 
this is the counterpart in the phonological calculus of the product elimination. 
The set of variables with at least one in r is denoted by Var{r). The number of 
occurrences of a variable x in a string s £ T* is denoted by \s\x, and the number 
of occurrences of a; in r by ipx{r)- A label is linear if for all x in V, ^Pxif) < 1. 

A substitution is a partial function from V to T* . For a a substitution, s a 
string of T* and r a label, we note s.a and r.a the string and the label obtained by 
the simultaneous substitution in s and r of the variables by the values associated 
by a (variables for which a is not defined remain the same). 

If the domain of definition of a substitution (J is finite and ecjual to x\ , . . . , 
and a{xi) = ti, then a is denoted by [ti/xi,. . . , tn/xn]- Moreover, for a sequence 
s and a label r, s.a and r.a are respectively denoted s[ti/xi, . . . ,tn/xn] and 
r[ti/xi, . . . , tn/xn]. Every injective substitution which takes values in V is called 
renaming. Two labels ri and r2 (respectively two strings Si and S2) are equal 
modulo a renaming of variables if there exists a renaming a such that ri.a = r2 
{resp. si.a = S2). 

Finally, we need another operation on string triplets which allows to combine 
them together: the string concatenation of T* is noted •. Let Concat be the 
operation of concatenation on labels which concatenates the three components 
in the linear order: for r £ S, Concat{r) = rgpec • ^head • i~comp- 

We then have defined a phonological structure which encodes specifier/comple- 
ment /head relations and two operations (substitution and concatenation) . These 
two operations will be counterparts in the phonological calculus of merge and 
move. 

Labelled proofs. Before exhibiting the rules of MCG, the concept of labelling 
on a subset of rules of the mixed logic is introduced. Minimalist logic is the 
fragment of mixed logic composed by the axiom rule, \e, /«, (Se and C 

For a given MCG G ~ {N,p,Lex,<l),C), let a G -background he x : A with 
X £ V and ^ G F, or (Gi;G2) or else (Gi,G2) with Gi and G2 some G- 
backgrounds which are defined on two disjoint sets of variables. G-backgrounds 
are series-parallel orders on subsets of 1^ x _F. They arc naturally extended to the 
entropy rule, noted C A G-sequent is a sequent of the form: P \-c {rs,rt, Tc) : B 
where -T is a G-background, B G F and {rs,rt,rc) € E. 



A G-labelling is a derivation of a G-sequent obtained with the following rules: 

(s, A) e Lex 



l-G (e,s,e) : A 



[Lex] 



\axiom\ 



X : A\-G (e, X, e) : A 
r^GTi-.A/B A\-Gr2: B Var{ri) n Var{r2) 

{L; A) \-G {ris, rubric • Concat{r2)) : A 
A\-Gr2:B /"he n : 5 \ A Var{ri) nVar{r2) 



Ue 



[\e 



{r-, A) \-G {Concat{r2) • ri^, rit, n^) : A 

r^GTi: A®B A[x: A,y: B]'^Gr2--C Variji) f]Var{r2) = % A €¥2 
A[r] he r2[Concat{r^)/x,t/y\ : C 

r he r : A r c r 



r"^Gr: A 

Note that a G-labcUing is a proof tree of the minimalist logic on which 
sequent hypotheses are decorated with variables and sequent conclusions are 
decorated with labels. Product elimination is used with a substitution on labels 
and implication connectors with concatenation (a triplet is introduced in another 
one by concatenating its three components). 

li r \-G r : B is & G-sequent derivable, then r is linear, and Var{r) is ex- 
actly the set of variables in F. Finally, for all renamings cr, r.a he r.a : B is a 
G-sequent differentiable. 



Merge and Move rules are simulated by combinations of rules of the mini- 
malist logic producing G-labeling. 

Merge is the elimination of / {resp. \) immediately followed by an entropy 
rule. The meaning of this rule is joining two elements in regard to the left-right 
order (then non-commutative connectors are used) and, as mentioned earlier, 
all hypotheses must be accessible. To respect this, a commutative order between 
hypotheses is needed. Then an entropy rule immediately follows each implication 
elimination. 

For the phonological tier, a label is concatenated in the complement (respec- 
tively specifier) position in another one. Note that a m,erge which uses / must 
be realized with a lexical item, so the context is always empty. 

I~ (j'specTTheadTTcomp) '■ A / B A h S : B ^ 

A \- {rspec, rhead, Tcornp • C Oncat{s)) - A 
[1=] 

^ I- {r spec, r head, rcomp • Concat{s)) : A 



Z\ h S : 5 r \- {Vspec^ f^head^ f^comp) '• B \ A 

I \e J 

{A; r) h {Concat{s) • rspec,rhead,rcomp) ■ A 

A,rV {Concat{s) • r^pec, rhead, Tcomp) ■■ A 

These combinations of rules are noted [mg] . 

For example, the proof of the utterance a man walks begins with the formulae 
of walks: case\v/d. The first step of the calculus is to introduce two hypotheses, 
one for d and the other for case. The result is the following proof: 

h (e, walks, e) : case\v/d u : d \^ {e, u, e) : d 
V : case h (e, v, e) : case u : d\- (e, walks, u) : case\v 

{v : case, u : d) \- (e, walks, u) : v 
In parallel, the derivation joins the determiner a and the noun man: 

h (e, a, e) : {case ®d)/n h (e, man, e) : n 

[mg] 

\- (e, a, man) : case (g) d 

Note that the first proof contains two hypotheses which correspond to the 
type of the main formula in the second proof. The link between these two proofs 
will be made by a move, as we will show later. 

Move is simulated by an elimination of a commutative product in a proof 
and, for the phonological calculus, is a substitution. We have structured the 
lexicons and the merge rule to delay to the move rule only the substitution part 
of the calculus. 

r^ri:A^B A[u : A,v : B]^ r2 : C 



A[r]\- r2[Concat{ri)/u,e/v]: C 

This rule is applied only if ^ e P2 and B is of the form Bi x ... B„ x D 

where Bi G P2 and D G Pi. 

This rule is noted [mv]. Move uses hypotheses as resources. The calculus 
places hypotheses in the proof, and when all hypotheses corresponding to a 
constituent are introduced, this constituent is substituted. The hypothesis Pi is 
the first place of a moved constituent and hypotheses of P2 mark the different 
places where the constituent is moved or have a trace. 

In recent propositions, Chomsky proposes to delay all moves after the real- 
isation of all merges. MCG could not encode this but contrary to MG where a 
move blocks all the process, in MCG merge could happen, except in the case of 
hypotheses of a given constituent shared by two proofs which must be linked by 
a move. 

In our example, we have two proofs: 



— one for the verb: (v : case, u : d) \~ (e, walks, u) : v 

— one for the DP: h (e, a, man) : case <Si d 

The first hypothesis corresponds to the entry position of the DP in MG 
and the second to the moved position. Here, we directly introduce the DP by 
ehminating the two hypotheses in the same step: 



h (e, a, man) : case <^ d {v : case, u : d) \- (e, walks, u) : v 

[mv] 

h (a man, walks, e) : v 



The phonological result is a man walks. The proof encodes the same structure 
as the derivational tree of MG (modulo a small transduction on the proof). 

For cyclic move (where a constituent is moved several times) all hypotheses 
inside this move must be linked together upon their introduction in the proof. For 
this, when a new hypothesis A is introduced, a [mv] is applied with a sequent with 
hypothesis A^B h A^B where A is in P2 and B is of the form Bi^. . .(E)Bn<S)D 
where Bi e P2 and Z) e Pi. 



X : A® Bh {e,x,e) : A(^ B A[u : A,v : B] \- r : C 
A[A(g) B]h r[x/u,e/v] : C 

In the definition of merge, the systematic use of entropy comes from the def- 
inition of move. As it was presented, move consumes hypotheses of the proof. 
But, from a linguistic perspective, these hypotheses could not be supposed in- 
troduced next to each other. The non-commutative order inferred from \e and 
/e blocks the move application. To avoid this, the entropy rule places them in 
commutative order. In MCG, all hypotheses are in the same relation, then to 
simplify the reading of proofs, the order is denoted only with 

The strong/weak move could be simulated with the localization of the sub- 
stitution (if hypotheses are in Pi or P2). 

s:r\-A(E)B r[u,v] : A[u : A,v : B]\- C 

— — — [movestrong] 

r[Concat{s)/u, t/v] : A[r] h C 
s:r\-A(E)B r[u,v] : A[u : A,v : B]\- C 



r[e/u,Concat{s)/v] : A[r] h C 



Imove^ 



This version of move is quite different from the one presented for MG, but is 
close to one developed for later MG such as pSj . 

The main difference between MG and MCG comes from move: in MCG, 
constituents do not move but use hypotheses marking their places. MCG uses 
commutativity properties of mixed logic and see hypotheses as resources. To sum 
up, the derivation rules of MCG is the following set of rules: 



{s,A) e Lex 



[Lex] 



^ {fspec: fhead') fcomp) '■ -A / B Zi h S : -B 



[mg] 



\-G (e,s,e) : A 



^ I- {rspec, Thead, Tcomp • Concat{s)) : A 



Z\ h S : -B _r h (jTspec: Theadi Tcornp) '■ B \ A 



[mg] 



A,r\~ {Concat{s) • r^pec, rhead, rcomp) ■ A 



r\-ri:A(»B A[u : A,v : B] \- r2 : C 



[mv] 



A[r] h r2[Concat{ri)/u,e/v] : C 



The set Dg of recognized derivations by a MCG G is the set of proofs obtained 
with this set of rules and for which the concluding sequent is h r : C. The 
language generated by G is L{G) = {Concat{r) \ h r : C e Dg}. 

These derivations do not formally conserve the projection relation (nor the 
specifier, head and complement relations). These principles arc reintroduced with 
strings. However, the head of a proof could be seen as the principal formula of 
mixed logic, and then by extension, the maximal projection is the proof for which 
a formula is the principal one. Specifi.er and complement are only elements on 
the right or left of this formula. 

An interesting remark is that rules of MCG do not use the introduction rule 
of the mixed calculus. This way, they only try to combine together formulae 
extracted from a lexicon and hypotheses. As in MG where a derivation cancels 
features, the MCG system only consumes hypotheses and always reduces the 
size of the main formula (only the size of the context could increase) . This corre- 
sponds to the cognitive fact that we stress the system in the analysis perspective. 
Introduction rules could be seen as captured by the given lexicon. But, because 
of the strong structure of the items, we directly associate formulae and strings. 

We have presented all the MCG rules and lexicon, and illustrated them with 
a tiny example which encodes the main properties of this framework. 

3 Conclusion 

In this article, we propose new definitions of MG based on an algebraic descrip- 
tion of trees. These definitions allow to check properties of this framework and 
moreover give a formal account to analyse links with other frameworks. Then, 
we give the definitions of MCG, a Type-Theoretic framework for MG. In this 
framework, merge and move are simulated by rules of the mixed logic (an ex- 
tension of Lambek calculus to product and non-commutative connectors). The 
phonological calculus is added by labelling proofs of this logic. 

The main contribution of MCG is certainly its syntax-semantics interface. 
This calculus is synchronized on proofs of MCG. But more technical details are 
needed to present this interface and the linguistic properties which it encodes. 
We delay the presentation of this interface to a future presentation. 

Finally, the syntax-semantics interface of MCG should be used under the 
condition they keep properties of MG. This is the aim of another future article 



which will present the proof of inclusion of MG generated languages in MCG 
generated languages. To prove this property, two alternative representations of 
MG and MCG derivations are introduced: alternative derived structures and 
split proofs and the corresponding merge and move. These structures and rules 
make the gap between the two kinds of derivations. They need technical details 
and more space to be presented. 

Definitions and proofs could be easily extended to refinements of merge: Affix- 
Hopping and Head-Movement bc;c;ause those operations derived the same strings 
in both structures. But we have not included these rules in this presentation. On 
another hand, the proof of inclusion presented here does not include the SMC. 
The interpretation of SMC in MCG must be better defined before being included 
in such perspective. The generative power of these grammars with shortest move 
condition is still open. 

This article is a first step to several perspectives which make a strong link 
between a well defined framework with many linguistic properties and a new one 
which captures this framework and proposes a syntax-semantics interface. 
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